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AUG 1 4 2002 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. TECH CENTER 1 600/2900 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR C'RF SUBMISSION QUESTIONS, PLEASE CONTACT MARKSPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTED 2.1 e-mail help: patM211help@nspto.gov or phone 703-306-41 19 (R. Was) 
PATENTIN 3.0 e-mail help: patnn3Snelp@iuispto.gov or phone 703-306-4119 (R. Was) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHRmi 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
tottp;//wwWoMspto.gov/welb/offfices/pac/checlk:er 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for die 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 



1. EFS-Bio (<http://wwwonspto.gov/ebc/efs/downloads/docnmente , EES Submission 
User Mannal - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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3 <110> APPLICANT: Vaisvila, Romualdus 

4 Morgan, Richard D. 

5 Kucera, Rebecca B. 

6 Claus, Toby B. 

7 Raleigh, Elisabeth A. 

9 <120> TITLE OF INVENTION: Method For Cloning And Producing The Msel Restriction 
Endonuclease 



11 


<130> 


FILE REFERENCE: NEB- 181 




















13 


<140> 


CURRENT APPLICATION NUMBER: US 09/689, 


343C 










14 


<141> 


CURRENT FILING DATE: 2000- 


10-12 
















16 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


9 




















18 


<170> 


SOFTWARE : 


Patentln version 3 . 1 
















20 


<210> 


SEQ ID NO 


1 


























21 


<211> 


LENGTH: 903 


























22 


<212> 


TYPE: 


DNA 




























23 


<213> 


ORGANISM: 


Micrococcus 


sp. 




















Z D 


<220> 


FEATURE : 




























26 


<221> 


NAME/KEY : 


CDS 


























27 


<222> 


LOCATION: 


(1). 


. (900) 






















28 


<223> 


OTHER 


INFORMATION: 
























30 


<400> 


SEQUENCE : 


1 


























32 


atg 


cct 


ate 


teg 


acc 


gtc 


tgg 


acg 


ccg 


gac 


gga 


gac 


gac 


etc 


ate 


gtg 


48 


33 


Met 


Pro 


He 


Ser 


Thr 


Val 


Trp 


Thr 


Pro 


Asp 


Gly 


Asp 


Asp 


Leu 


He 


Val 




34 


1 










5 










10 










15 






36 


gag 


gcg 


gac 


aac 


etc 


gat 


ttc 


att 


caa 


acg 


etc 


ccc 


gac 


gcg 


age 


ttc 


96 


37 


Glu 


Ala 


Asp 


Asn 


Leu 


Asp 


Phe 


He 


Gin 


Thr 


Leu 


Pro 


Asp 


Ala 


Ser 


Phe 




38 










20 










25 










30 








40 


cga 


atg 


ate 


tac 


ate 


gat 


ccg 


ccg 


ttc 


aac 


aca 


ggg 


cga 


acg 


cag 


egg 


144 


41 


Arg 


Met 


He 


Tyr 


He 


Asp 


Pro 


Pro 


Phe 


Asn 


Thr 


Gly 


Arg 


Thr 


Gin 


Arg 




42 








35 










40 










45 










44 


ctt 


cag 


teg 


etc 


aag 


acg 


acc 


cgc 


teg 


gtc 


aca 


ggg 


teg 


cga 


gtc 


ggc 


192 


45 


Leu 


Gin 


Ser 


Leu 


Lys 


Thr 


Thr 


Arg 


Ser 


Val 


Thr 


Gly 


Ser 


Arg 


Val 


Gly 




46 




50 










55 










60 












48 


ttc 


aaa 


ggc 


cag 


acg 


tac 


gac 


acg 


gtc 


aag 


age 


act 


ctg 


cac 


teg 


tat 


240 


49 


Phe 


Lys 


Gly 


Gin 


Thr 


Tyr 


Asp 


Thr 


Val 


Lys 


Ser 


Thr 


Leu 


His 


Ser 


Tyr 




50 


65 












70 










75 










80 




52 


gac 


gac 


get 


ttc 


acc 


gac 


tat 


tgg 


teg 


ttc 


etc 


gaa 


ccg 


cgt 


etc 


ctg 


288 


53 


Asp 




Ala 


Phe 


Thr 


Asp 


Tyr 


Trp 


Ser 


Phe 


Leu 


Glu 


Pro 


Arg 


Leu 


Leu 




54 












85 










90 










95 






56 


gag 


get 


tgg 


egg 


ttg 


etc 


acc 


cct 


gac 


ggc 


gcg 


etc 


tat 


ctt 


cat 


ctg 


336 


57 


Glu 


Ala 


Trp 


Arg 


Leu 


Leu 


Thr 


Pro 


Asp 


Gly 


Ala 


Leu 


Tyr 


Leu 


His 


Leu 




58 










100 










105 










110 








60 


gat 


tac 


cgc 


gag 


gtt 


cac 


tac 


gec 


aag 


gtc 


gtc 


etc 


gac 


gcg 


atg 


ttc 


384 


61 


Asp 


Tyr 


Arg 


Glu 


Val 


His 


Tyr 


Ala 


Lys 


Val 


Val 


Leu 


Asp 


Ala 


Met 


Phe 
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62 






115 










120 










125 










64 


craa 


cqc 


gaa 


age 


ttc 


ctg 


aac 


gag 


ctg 


ate 


tgg 


gcg 


tac 


gac 


tac 


ggc 


432 


65 


Gly 


Arg 


Glu 


Ser 


Phe 


Leu 


Asn 


Glu 


Leu 


He 


Trp 


Ala 


Tyr 


Asp 


Tyr 


Gly 




66 




130 










135 










140 












68 


gcg 


cgc 


teg 


aag 


age 


aag 


tgg 


ccc 


acc 


aag 


cac 


gac 


aac 


ate 


etc 


gtg 


480 


69 


Ala 


Arg 


Ser 


Lys 


Ser 


Lys 


Trp 


Pro 


Thr 


Lys 


His 


Asp 


Asn 


He 


Leu 


Val 




70 


145 










150 










155 










160 




72 


tat 


crtq 

_7 ^ -J 


aaq 


crac 


ccg 


aac 


aac 


tac 


gtc 


tgg 


aac 


ggt 


cag 


gat 


gta 


gat 


528 


73 


Tyr 


Val 


Lys 


Asp 


Pro 


Asn 


Asn 


Tyr 


Val 


Trp 


Asn 


Gly 


Gin 


Asp 


Val 


Asp 




74 










165 










170 










175 






76 


cgc 


cracr 


ccc 


tac 


atg 


crccr 


ccc 


3 3 7 


etc 


gtt 


aca 


ccc 


gag 


aag 


gta 


gcg 


576 


77 


Arg 


Glu 


Pro 


Tyr 


Met 


Ala 


Pro 


Gly 


Leu 


Val 


Thr 


Pro 


Glu 


Lys 


Val 


Ala 




78 








180 










185 










190 








80 


ctt 


crac 


aacr 


ctg 


ccc 


acc 


crac 


crtc 


tqq 


tag 

_7 _7 


cac 


aca 


ate 


gtt 


ccg 


cct 


624 


81 


Leu 


Gly 


Lys 


Leu 


Pro 


Thr 


Asp 


Val 


Trp 


Trp 


His 


Thr 


He 


Val 


Pro 


Pro 




82 






195 










200 










205 










84 


gcg 


age 


aaa 


gag 


cgc 


acc 


ggg 


tac 


gcg 


aca 


cag 


aag 


ccg 


gtc 


ggc 


ate 


672 


85 


Ala 


Ser 


Lys 


Glu 


Arg 


Thr 


Gly 


Tyr 


Ala 


Thr 


Gin 


Lys 


Pro 


Val 


Gly 


He 




86 




210 










215 










220 












88 


ate 


cgt 


cgc 


atg 


att 


cag 


gcg 


age 


age 


aat 


gaa 


ggc 


gac 


tgg 


gtt 


ctg 


720 


89 


He 


Arg 


Arg 


Met 


He 


Gin 


Ala 


Ser 


Ser 


Asn 


Glu 


Gly 


Asp 


Trp 


Val 


Leu 




90 


225 










230 










235 










240 




92 


gat 


ttc 


ttc 


get 


ggt 


agt 


ggg 


acg 


acc 


ggc 


gee 


gcg 


gee 


cgc 


cag 


etc 


768 


93 


Asp 


Phe 


Phe 


Ala 


Gly 


Ser 


Gly 


Thr 


Thr 


Gly 


Ala 


Ala 


Ala 


Arg 


Gin 


Leu 




94 










245 










250 










255 






96 


gga 


cgc 


cgt 


ttt 


gtg 


etc 


gta 


gac 


gtc 


aac 


cca 


gaa 


gca 


ate 


gcg 


gta 


816 


97 


Gly 


Arg 


Arg 


Phe 


Val 


Leu 


Val 


Asp 


Val 


Asn 


Pro 


Glu 


Ala 


He 


Ala 


Val 





98 260 265 270 

100 atg gca aaa egg ttg gat gac ggg gca ttg gac acc age gtg acg ate 864 

101 Met Ala Lys Arg Leu Asp Asp Gly Ala Leu Asp Thr Ser Val Thr He 

102 275 280 285 

104 gtg cag act ccc cag agt gac cca cga acc gac gga tga 903 

105 Val Gin Thr Pro Gin Ser Asp Pro Arg Thr Asp Gly 

106 290 295 300 

109 <210> SEQ ID NO: 2 

110 <211> LENGTH: 300 

111 <212> TYPE: PRT 

112 <213> ORGANISM: Micrococcus sp. 
114 <400> SEQUENCE: 2 

116 Met Pro He Ser Thr Val Trp Thr Pro Asp Gly Asp Asp Leu He Val 

117 15 10 15 

119 Glu Ala Asp Asn Leu Asp Phe He Gin Thr Leu Pro Asp Ala Ser Phe 

120 20 25 30 

122 Arg Met He Tyr He Asp Pro Pro Phe Asn Thr Gly Arg Thr Gin Arg 

123 35 40 45 

125 Leu Gin Ser Leu Lys Thr Thr Arg Ser Val Thr Gly Ser Arg Val Gly 

126 50 55 60 

12 8 Phe Lys Gly Gin Thr Tyr Asp Thr Val Lys Ser Thr Leu His Ser Tyr 

129 65 70 75 80 
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131 


Asp 


Asp 


Ala 


Phe 


Thr 


Asp 


Tvr 

A Jr ± - 


Tro 

tr 


Ser 


Phe 


Leu 


Glu 


Pro 


Arcr 


Leu 


Leu 


132 












85 










90 










95 




134 


Glu 


Ala 


Tro 

tr 


Ara 


Leu 


Leu 


Thr 


Pro 


Asp 


Gly 

w J; 


Ala 


Leu 


Tvr 

JT 


Leu 


His 


Leu 


135 










100 










105 










110 






137 


Asp 


Tyr 


Arg 


Glu 


Val 


His 


Tvr 


Ala 


Lys 


Val 


Val 


Leu 


Asp 


Ala 


Met 


Phe 


138 








115 










120 










125 








140 


Gly Arg 


Glu 


Ser 


Phe 


Leu 


Asn 


Glu 


Leu 


lie 


TrD 

tr 


Ala 


Tvr 


Asp 


Tvr 

A Jr - 1 - 


Glv 

Jr 


141 




130 










135 










140 










143 


Ala 


Arg 


Ser 


Lys 


Ser 


Lys 


Tro 


Pro 


Thr 


Lys 


His 


Asp 


Asn 


lie 


Leu 


Val 


144 


145 












150 










155 










160 


146 


Tyr 


Val 


Lys 


Asp 


Pro 


Asn 


Asn 


Tvr 

A Jr 


Val 


Tro 

-Lip 


Asn 


Glv 

VJ -L jY 


Gin 


Asp 


Val 


Asp 


147 












165 










170 










175 




149 


Arg 


Glu 


Pro 




Met 


Ala 


Pro 


Glv 


Leu 


Val 


Thr 


Pro 


Glu 


Lys 


Val 


Ala 


150 










180 










185 










190 






152 


Leu 


Gly 


Lys 


Leu 


Pro 


Thr 


Asp 


Val 


tr 


TrD 

tr 


His 


Thr 


lie 


Val 


Pro 


Pro 


153 








195 










200 










205 








155 


Ala 


Ser 


Lys 


Glu 


Arcr 


Thr 


Glv 

\J -L. Jf 


Tvr 


Ala 


Thr 


Gin 


Lys 


Pro 


Val 


Gly 


lie 


156 




210 










215 










220 










158 


He 


Arg 


Arg 


Met 


He 


Gin 


Ala 


Ser 


Ser 


Asn 


Glu 


Gly 


Asp 

tr 


Trp 


Val 


Leu 


159 


225 












230 










235 










240 


161 


Asp 


Phe 


Phe 


Ala 


Gly 


Ser 


Glv 


Thr 


Thr 


Glv 


Ala 


Ala 


Ala 


Arcr 


Gin 


Leu 


162 












245 










250 










255 




164 


Gly Arg 


Arg 


Phe 


Val 


Leu 


Val 


Asp 


Val 


Asn 


Pro 


Glu 


Ala 


lie 


Ala 


Val 


165 










260 










265 










270 






167 


Met 


Ala 


Lys 


Arg 


Leu 


Asp 


Asp 


Gly 


Ala 


Leu 


Asp 


Thr 


Ser 


Val 


Thr 


He 


168 








275 










280 










285 








170 


Val 


Gin 


Thr 


Pro 


Gin 


Ser 


Asp 


Pro 


Arg 


Thr 


Asp 


Glv 










171 




290 










295 










300 










173 


<210> 


SEQ ID NO 


3 
























174 


<211> 


LENGTH: 1236 
























175 


<212> 


TYPE: 


DNA 


























176 


<213> 


ORGANISM: 


Unknown 






















178 


<220> 


FEATURE : 


























179 


<223> 


OTHER 


INFORMATION 


: Environmental DNA 












181 


<220> 


FEATURE : 


























182 


<221> 


NAME/KEY : 


CDS 
























183 


<222> 


LOCATION: 


(1). 


. (1233) 




















184 


<223> 


OTHER 


INFORMATION 






















186 


<220> 


FEATURE : 


























187 


<221> 


NAME/KEY : 


misc_f eature 


















188 


<222> 


LOCATION: 


(198).. 


(198) 




















189 


<223> 


OTHER 


INFORMATION 


: Xaa 


= any amino acid 










191 


<220> 


FEATURE : 


























192 


<221> 


NAME/KEY : 


misc_f eature 


















193 


<222> 


LOCATION: 


(594).. 


(594) 




















194 


<223> 


OTHER 


INFORMATION 


: N= 


G, A, C 


or T 












196 


<400> 


SEQUENCE : 


3 
























198 


atg 


cct 


aca 


ctg 


gat 


tgg 


ccc 


ggt 


aaa 


cag 


tta 


age 


ttc 


cca 


cca 


get 


199 


Met 


Pro 


Thr 


Leu 


Asp 


Trp 


Pro 


Gly 


Lys 


Gin 


Leu 


Ser 


Phe 


Pro 


Pro 


Ala 
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200 


1 








5 










10 










15 






202 


acc 


tec 


ttg 


cat 


ctg 


gag 


agt 


gtg 


gtc 


act 


gag 


gga 


gcg 


gag 


tea 


ccg 


96 


203 


Thr 


Ser 


Leu 


His 


Leu 


Glu 


Ser 


Val 


Val 


Thr 


Glu 


Gly 


Ala 


Glu 


Ser 


Pro 




204 








20 










25 










30 








206 


cct 


aat 


cat 


eta 


att 


tgg 


gcg 


gac 


aac 


ctg 


ccg 


eta 


atg 


gta 


gat 


ttg 


144 


207 


Pro 


Asn 


Arg 


Leu 


He 


Trp 


Ala 


Asp 


Asn 


Leu 


Pro 


Leu 


Met 


Val 


Asp 


Leu 




208 






35 










40 










45 










210 


ttg 




aaa 


tat 


aaa 


aaa 

j z* zj 


aaa 


ate 


gat 


ctg 


ate 


tac 


gee 


gat 


ccc 


cct 


192 


211 


Leu 


Ala 


Glu 


Tyr 


Glu 


Gly 


Lys 


He 


Asp 


Leu 


He 


Tyr 


Ala 


Asp 


Pro 


Pro 




212 




50 










55 










60 












214 


ttt 


ttt 


acg 


gat 


cat 


act 


tat 


aca 

ZJ w ZJ 


aca 


caa 


att 


aat 

ZJ ZJ 


cat 


aaa 

J Zi Zf 


gag 


gat 


240 


215 


Phe 


Phe 


Thr 


Asp 


Ara 


Thr 


Tyr 


Ala 


Ala 


Arg 


He 


Gly 


His 


Gly 


Glu 


Asp 




216 


65 










70 










75 










80 




218 


teg 


cgt 


cgt 


cca 


caa 


acc 


taa 
*-y y 


cag 


ctt 


aca 

Zf 


aaa 


aaa 


tat 


aca 


aac 


aaa 

ZJ ZJ 


288 


219 


Ser 


Ara 


Ara 


Pro 


Gin 


Thr 


Trp 


Gin 


Leu 


Ala 


Glu 


Gly 


Tyr 


Thr 


Asp 


Glu 




220 










85 










90 










95 






222 




aag 


gat 


tta 


aat 


gaa 


tac 


ctg 


gac 


ttc 


ctt 


tat 


cca 


cac 


eta 

ZJ 


ata 

ZJ 


336 


223 


Trp 


Lys 


Asp 


Leu 


Asp 


Glu 


Tyr 


Leu 


Asp 


Phe 


Leu 


Tyr 


Pro 


Arg 


Leu 


Val 




224 








100 










105 










110 








226 


ctg 


atg 


tat 


cga 


ctg 


ctg 


aca 

z> ^ 


cca 


cac 


aaa 

ZJ ZJ 


aca 


etc 


tac 


ttg 


cac 


ctg 


384 


227 


Leu 


Met 


Tyr 


Arg 

* »-*- ZJ 


Leu 


Leu 


Ala 


Pro 


His 


Gly 


Thr 


Leu 


Tyr 


Leu 


His 


Leu 




228 






115 










120 










125 










230 


gac 




cac 


acc 


aat 


acc 

z* 


tac 


ata 


cat 


gta 


ctg 


ctt 


gat 


gag 


ate 


ttc 


432 


231 


Asp 


Trp 


His 


Ala 


Asn 


Ala 


Tyr 


Val 


Arg 


Val 


Leu 


Leu 


Asp 


Glu 


He 


Phe 




232 




130 










135 










140 












234 


3 3 V 


caa 


cacr 


caa 


ttt 


etc 


aac 


gag 


ate 


gtc 


tgg 


ate 


tat 


cac 


ggc 


ccc 


480 


235 


Gly 


Ara 


Gin 


Arg 


Phe 


Leu 


Asn 


Glu 


He 


Val 


Trp 


He 


Tyr 


His 


Gly 


Pro 




236 


145 










150 










155 










160 




238 


tea 


QCC 


ate 


caa 


cac 


acc 


ttc 


aag 


cgc 


aaa 


cat 


gat 


acc 


ate 


ttg 


gtt 


528 


239 


Ser 


Ala 


lie 


Arq 


Arg 


Ala 


Phe 


Lys 


Arg 


Lys 


His 


Asp 


Thr 


He 


Leu 


Val 




240 










165 










170 










175 






242 


tat 


crtcr 
y *-y 


aaa 


aat 


aaa 


aac 


tat 


aca 


ttc 


aat 


gcg 


gat 


gcg 


gtt 


cgt 


caa 


576 


243 


Tyr 


Val 


Lys 


Gly 


Glu 


Asn 


Xyr 


Thr 


Phe 


Asn 


Ala 


Asp 


Ala 


Val 


Arg 


Gin 




244 








180 










185 










190 








246 


cct 


tac 


cat 


cca 


aac 


acn 


cat 


aaa 


acc 


ttc 


act 


tec 


tec 


ccg 


aag 


gec 


624 


247 


Pro 


Tvr 


His 


Pro 


Ser 


Xaa 


His 


Lys 


Thr 


Phe 


Ala 


Ser 


Ser 


Pro 


Lys 


Ala 




248 






195 










200 










205 










250 


y y 


ttt 


aat 
y y u 


aag 


ata 
y *-y 


ccg 


gat 


ctg 


cag 


cgc 


aac 


aaa 


ata 

ZJ _3 


ccc 


aaa 


gac 


672 


251 


Gly 


Phe 


Gly 


Lys 


Val 


Pro 


Asp 


Leu 


Gin 


Arg 


Gly 


Lys 


Val 


Pro 


Glu 


Asp 




252 




210 










215 










220 












254 


tgg 


tgg 


tat 


ttt 


ccg 


gtc 


gtg 


gec 


cgt 


eta 


cac 


cga 


gaa 


egg 


age 


ggc 


720 


255 


Trp 


Trp 


Tyr 


Phe 


Pro 


Val 


Val 


Ala 


Arg 


Leu 


His 


Arg 


Glu 


Arg 


Ser 


Gly 




256 


225 










230 










235 










240 




258 


tat 


ccg 


act 


caa 


aag 


cct 


caa 


gee 


ttg 


ctg 


gag 


egg 


ate 


ctg 


ctg 


gec 


768 


259 


Tyr 


Pro 


Thr 


Gin 


Lys 


Pro 


Gin 


Ala 


Leu 


Leu 


Glu 


Arg 


He 


Leu 


Leu 


Ala 




260 










245 










250 










255 






262 


tec 


teg 


aac 


gca 


ggc 


gat 


ctg 


gtg 


gca 


gac 


ttc 


ttc 


tgc 


ggc 


tea 


ggg 


816 


263 


Ser 


Ser 


Asn 


Ala 


Gly 


Asp 


Leu 


Val 


Ala 


Asp 


Phe 


Phe 


Cys 


Gly 


Ser 


Gly 




264 








260 










265 










270 
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RAW SEQUENCE LISTING DATE: 07/19/2002 

PATENT APPLICATION: US/09/689 f 34 3C TIME: 14:59:21 

Input Set : A:\NEB-181.ST25.txt 
Output Set: N:\CRF3\07192002\I689343C.raw 

266 aca acc get gtg gtg gca gec cgt ctg gga egg cgc ttc ctg gtc aac 864 

267 Thr Thr Ala Val Val Ala Ala Arg Leu Gly Arg Arg Phe Leu Val Asn 

268 275 280 285 

270 gat gca age tgg cgc gec gtt cat gtg aca cgc aca cgc ttg eta cgc 912 

271 Asp Ala Ser Trp Arg Ala Val His Val Thr Arg Thr Arg Leu Leu Arg 

272 290 295 300 

274 gag gga gta agt ttc act ttt gaa cgc cag gaa act ttt act eta cct 960 

275 Glu Gly Val Ser Phe Thr Phe Glu Arg Gin Glu Thr Phe Thr Leu Pro 

276 305 310 315 320 

278 ate cag cca ctt cca cca gat tgg ttg ate ate gec gag gag cag att 1008 

279 lie Gin Pro Leu Pro Pro Asp Trp Leu lie lie Ala Glu Glu Gin lie 

280 325 330 335 

282 cgc etc caa gca ccc ttt etc gta gat ttt tgg gaa gtg gac gat caa 1056 

283 Arg Leu Gin Ala Pro Phe Leu Val Asp Phe Trp Glu Val Asp Asp Gin 

284 340 345 350 

286 tgg gat ggc aaa ate ttc cgc age cgt cat caa ggc tta cgc tec cgc 1104 

287 Trp Asp Gly Lys lie Phe Arg Ser Arg His Gin Gly Leu Arg Ser Arg 

288 355 360 365 

290 ctt cag gag cag gcg ccg etc tct eta cca ttg acc ggg aat gga ctg 1152 

291 Leu Gin Glu Gin Ala Pro Leu Ser Leu Pro Leu Thr Gly Asn Gly Leu 

292 370 375 380 

294 ttg tgt gta egg gta gtg age cgt gaa ggg gaa tac tat gag ttc aca 1200 

295 Leu Cys Val Arg Val Val Ser Arg Glu Gly Glu Tyr Tyr Glu Phe Thr 

296 385 390 395 400 

298 ggt cga gee gat age cct cac ccc gta teg ttt tga 1236 

299 Gly Arg Ala Asp Ser Pro His Pro Val Ser Phe 

300 405 410 

303 <210> SEQ ID NO: 4 

304 <211> LENGTH: 411 

305 <212> TYPE: PRT 

306 <213> ORGANISM: Unknown 

308 <220> FEATURE: 

309 <223> OTHER INFORMATION: Environmental DNA A 

311 <220> FEATURE: *>{0^ 

312 <221> NAME/KEY: misc_f eature -/fW*^ 

313 <222> LOCATION: (198).. (198) * ^ 



... 

314 <223> OTHER INFORMATION: Xaa = any amino acid / ^ qq. 



316 .<220> FEATURE 

317 <221> NAME/KEY: misj ^oatu xe^ 1 0 (p 

318 <222> LOCATION/ 




319 <223> OTHER IN 

321 <400> SEQUENCE: 

323 Met Pro Thr Leu Asp Trp^Tu^&^^s^J^ Phe Pro Pro Ala 

324 1 5 10 15 

326 Thr Ser Leu His Leu Glu Ser Val Val Thr Glu Gly Ala Glu Ser Pro 

327 20 25 30 

329 Pro Asn Arg Leu lie Trp Ala Asp Asn Leu Pro Leu Met Val Asp Leu 

330 35 40 45 

332 Leu Ala Glu Tyr Glu Gly Lys lie Asp Leu lie Tyr Ala Asp Pro Pro 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/689, 343C 



DATE: 07/19/2002 
TIME: 14:59:22 



Input Set : A:\NEB-181.ST25.txt 

Output Set: N:\CRF3\07192002\I689343C.raw 



Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:3; N Pos . 594 
Seq#:3; Xaa Pos. 198 
Seq#:4; Xaa Pos. 198 
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VERIFICATION SUMMARY DATE: 07/19/2002 

PATENT APPLICATION: US/09/689 f 34 3C TIME: 14:59:22 

Input Set : A:\NEB-181.ST25.txt 

Output Set: N:\CRF3\07192002\I689343C.raw 

L:246 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 after pos.:576 
L:247 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 after pos.:624 
L:359 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 after pos.:192 
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